Robust Finite-State Controllers for Uncertain POMDPs
نویسندگان
چکیده
Uncertain partially observable Markov decision processes (uPOMDPs) allow the probabilistic transition and observation functions of standard POMDPs to belong a so-called uncertainty set. Such uncertainty, referred as epistemic captures uncountable sets probability distributions caused by, for instance, lack data available. We develop an algorithm compute finite-memory policies uPOMDPs that robustly satisfy specifications against any admissible distribution. In general, computing such is theoretically practically intractable. provide efficient solution this problem in four steps. (1) state underlying nonconvex optimization with infinitely many constraints. (2) A dedicated dualization scheme yields dual still but has finitely (3) linearize (4) solve resulting finite linear program obtain locally optimal solutions original problem. The formulation exponentially smaller than those from existing methods. demonstrate applicability our using large instances aircraft collision-avoidance scenario novel spacecraft motion planning case study.
منابع مشابه
Sparse Stochastic Finite-State Controllers for POMDPs
Bounded policy iteration is an approach to solving infinite-horizon POMDPs that represents policies as stochastic finite-state controllers and iteratively improves a controller by adjusting the parameters of each node using linear programming. In the original algorithm, the size of the linear programs, and thus the complexity of policy improvement, depends on the number of parameters of each no...
متن کاملSynthesis of Hierarchical Finite-State Controllers for POMDPs
We develop a hierarchical approach to planning for partially observable Markov decision processes (POMDPs) in which a policy is represented as a hierarchical finite-state controller. To provide a foundation for this approach, we discuss some extensions of the POMDP framework that allow us to formalize the process of abstraction by which a hierarchical controller is constructed. Then we describe...
متن کاملPermissive Finite-State Controllers of POMDPs using Parameter Synthesis
We study finite-state controllers (FSCs) for partially observable Markov decision processes (POMDPs). The key insight is that computing (randomized) FSCs on POMDPs is equivalent to synthesis for parametric Markov chains (pMCs). This correspondence enables using parameter synthesis techniques to compute FSCs for POMDPs in a black-box fashion. We investigate how typical restrictions on parameter ...
متن کاملFinite-State Controllers Based on Mealy Machines for Centralized and Decentralized POMDPs
Existing controller-based approaches for centralized and decentralized POMDPs are based on automata with output known as Moore machines. In this paper, we show that several advantages can be gained by utilizing another type of automata, the Mealy machine. Mealy machines are more powerful than Moore machines, provide a richer structure that can be exploited by solution methods, and can be easily...
متن کاملBounded Finite State Controllers
We describe a new approximation algorithm for solving partially observable MDPs. Our bounded policy iteration approach searches through the space of bounded-size, stochastic finite state controllers, combining several advantages of gradient ascent (efficiency, search through restricted controller space) and policy iteration (less vulnerability to local optima).
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i13.17401